Precision of Rough Set Clustering
نویسندگان
چکیده
Conventional clustering algorithms categorize an object into precisely one cluster. In many applications, the membership of some of the objects to a cluster can be ambiguous. Therefore, an ability to specify membership to multiple clusters can be useful in real world applications. Fuzzy clustering makes it possible to specify the degree to which a given object belongs to a cluster. In Rough set representations, an object may belong to more than one cluster, which is more flexible than the conventional crisp clusters and less verbose than the fuzzy clusters. The unsupervised nature of fuzzy and rough algorithms means that there is a choice about the level of precision depending on the choice of parameters. This paper describes how one can vary the precision of the rough set clustering and studies its effect on synthetic and real world data sets.
منابع مشابه
Hierarchical clustering algorithm for categorical data using a probabilistic rough set model
Several clustering analysis techniques for categorical data exist to divide similar objects into groups. Some are able to handle uncertainty in the clustering process, whereas others have stability issues. In this paper, we propose a new technique called TMDP (Total Mean Distribution Precision) for selecting the partitioning attribute based on probabilistic rough set theory. On the basis of thi...
متن کاملRough set with Effective Clustering Method
Rough set theory is a powerful mathematical tool that has been applied widely to extract knowledge from many databases .Rough set theory is proposed to mine rules from the Data warehouse. It constructs concise classification rules for each concept satisfying the given classification accuracy. Due to some drawbacks we suggest rough set with clustering methods to achieve more precision and Shows ...
متن کاملMining fuzzy rules from quantitative data based on the Variable Precision Rough Set and Rough Fuzzy Set Theories
In this paper, a different approach to extract the threshold value β of Variable Precision Rough Set (VPRS) applied to continuous information systems is presented. This study combines the Fuzzy Set and Rough Fuzzy Set (RFS) theories to determine the β value of VPRS. The β value was determined by the Fuzzy C-means and relevant Fuzzy theories, for the reason that errors of system classification c...
متن کاملRough Document Clustering and The Internet
Searching for information on the web has attracted many research communities. Due to the enormous size of the web and low precision of user queries, finding the right information from the web is the difficult or even impossible task. Clustering, one of the most the fundamental tools in Granular Computing (GrC), offers an interesting approach to this problem. By grouping of similar documents, cl...
متن کاملIdentification of Reliable Information for Classification Problems
A novel information identification model is proposed to support accurate classification tasks with mixtures of categorical and real-valued attributes. This model combines the advantages of rough set theory and cluster validity method to promote the classification quality to the higher levels. Real-valued attribute values are pre-processed by fuzzy c-means clustering method and then analyzed by ...
متن کامل